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1.  INTRODUCTION 


This  report  describes  die  data  sources,  the  data  processing  activities  (data  extraction,  organization, 
compression,  and  prediction),  and  the  blending  of  sky/cloud  cover  statistics  which  are  vital  to  the 
development  of  an  automated  fast  retrieval  global  Climatology  of  Cloud  Statistics  (C  Cloud  S)  The 
material  presented  supersedes  the  work  presented  in  an  interim  report  by  Willand  and  Steeves  ( lyvi).  It 
begins  with  a  brief  description  of  the  overall  C  Cloud  S  system.  The  sections  that  follow  describe  in 
detail  the  processing  and  blending  of  the  climatological  databases  chosen  to  represent  realistic  glcbal  cloud 
amount  statistics  within  an  automated  C  Cloud  S  program. 

2.  C  CLOUDS  S  SYSTEM  OVERVIEW 

C  Cloud  S  is  based  on  many  years  of  cloud  observations.  It  does  not  provide  observed  values  for  a  given 
date  and  time  in  the  past  and  therefore  is  not  a  historical  database.  Instead  it  is  intended  to  be  used  directly 
as  either  input  to  simulation  models  or  to  provide  a  synthetic  series  ol  cloud  or  cloud-free  Ime-of-sight 
predictions  that  have  realistic  probability  distributions  and  are  realistically  correlated  in  time  and  space. 

An  overview  of  the  complete  C  Cloud  S  system  is  portrayed  in  Figure  I .  As  shown,  the  system  is  divided 
into  three  subdivisions:  Data  Input,  Compact  Storage,  and  Cloud  Statistics  plus  or  minus  Error. 

2.1  DATA  INPUT 

Data  input  to  the  C  Cloud  S  system  consists  of  four  climatological  data  ensembles 

1  The  Burger  cloud  climatology  from  land  observations 

2  Department  of  Energy  (DOE)  cloud  climatology  from  land  ?.nd  sea  surface  cloud 
observations 

3  The  NIMBUS-7  satellite  CMATRIX  cloud  climatology  and 

4  A  subset  of  the  DATSAV  climatology  that  provided  hourly  sequences  of  surface  sky 
cover  observations  used  for  deriving  temporal  and  spaual  correlations  of  cloud  covers 
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Figure  I.  C  Cloud  S  System  Overview  from  Star!  to  Finish. 


Spatial  and  temporal  attributes  of  these  climatic  databases  are  summarized  in  Table  1.  Note  that  none  of 
the  spatial  or  temporal  attributes  of  the  databases  are  the  same.  For  example.  DOE/NCAR  data  arc  mapped 
into  a  5  x  5  degree  grid  system,  whereas  NIMBUS-7  data  arc  in  a  2070  Earth  Radiation  Budget  target  area 
grid.  Burger  data  and  DATSAV  data  are  sky  dome  point  climatologies  gridded  randomly.  DOK  data  are 
stratihed  by  season  at  every  three  hours  GMT  while  NIMBUS-7  data  are  archived  daily  at  noon  and 
midnight  MAST  (mean  apparent  solar  time).  Burger  data,  on  the  other  hand,  are  mid-seasonal  monthly 
means  and  histograms  ol  three  hourly  mean  sky  covers  centered  over  four  local  standard  times  of  day. 
DATSAV  data  are  mostly  archived  24  hours  daily  in  GMT.  Finally,  Burger,  DOF,  and  DATSAV  data 
archive  sky  cover  amounts  that  are  estimated  by  surface  weather  observers  while  the  NIMBUS-7  data 
archives  cloud  amounts  as  observed  from  space.  (Sky  cover  is  defined  as  the  amount  of  a  sky  dome  area 
covered  by  clouds  as  estimated  within  the  perspective  of  an  up-looking  device  or  a  ground  observer.  Cloud 
cover  on  the  other  hand  is  the  amount  of  earth  covered  by  clouds  within  a  given  field  of  view  as  observed 
from  the  down-looking  perspective  of  an  earth  viewing  satellite.) 


Table  1.  Summary  Table  of  Cloud  Databases  for  C  Cloud  S  1.0 


Database 

DOE/NCAR 

NIMBUS-7 

Burger 

DATSAV 

Source 

Surface 

Sun 

Surface 

Surface 

Observed 

synchronous 

Observed 

observed 

sky  covers 

satellite 

sky  covers 

sky  covers 

within  an 

derived 

over 

over 

area. 

cloud 

stations. 

stations. 

cover, 

Area 

555  x  555  km 

500  x  500  km 

sky  dome 

sky  dome 

Resolution 

-  (5  x  5)  deg 

Points/Map 

1820 

2070 

2300 

92 

Yearly 

4  seasons 

Daily 

Mid- 

Hourly 

Stratified 

seasonal 

monthly. 

Times  of 

3  hourly 

Twice/day, 

4  hours  LST 

24  hours 

Day 

0,3, ...GMT. 

00-12  MAST. 

1,7,13,19. 

GMT. 

Period  of 

Land,  11 

6  years, 

29  years, 

11  years, 

Record 

years, 

1979-1985. 

1945-1973. 

1973-1983 

1971-1981. 

or 

Ocean,  54 

10  years, 

years, 

1930-1983 

1977-1986 
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2.1.1  Data  Extraction 


Data  extraction  methods  were  initialed  to  extract  Iront  these  climatological  data  ensembles  the  mean 
sky/doud  cover  populations  and  their  variances  and  interannual  standard  deviations  Computed  mean 
correlations,  defined  below,  were  also  assembled  using  mean  sky/doud  covers  and  variances  extracted  lor 
each  of  the  points  encountered  within  a  dataset.  (Parameterization  of  sky  cover  distributions  over  selected 
locations  can  be  generated  by  combining  the  two  parameters  of  mean  sky  cover  and  mean  correlation.  The 
algorithm  for  the  parameterization  can  be  found  in  Boehm  ( 1992).)  All  extracted  and  computed  parameters 
including  date-time  groups  and  latitude-longitude  locators  were  stored  in  a  convenient  standard  format  for 
subsequent  processing 

Mean  correlauon  is  defined  as  the  mean  of  the  correlations  of  given  conditions  (such  as  sky/cloud 
coverage)  between  all  pairs  of  points  in  a  domain  where  a  domain  can  be  either  a  line,  an  area,  or  a 
volume 

2.1.2  Weighted  Spectral  Analysis 

A  weighted  spectral  analysis  program  was  developed  and  utilized  to  compress  the  extracted  data  into  the 
form  of  spatial  coefficients  using  a  Fourier  analysis  in  longitude  and  a  fully  normalized  associated 
Legendre  function  in  lautude.  The  coefficients  derived  are  used  to  predict  five  basic  parameters  given  the 
coordinates  of  any  location  on  earth.  These  five  parameters  are  mean  sky  cover,  mean  sky  dome 
correlation,  population,  interannual  standard  deviation  of  mean  sky  cover,  and  satellite  mean  correlation 
A  Founer  senes  can  be  used  to  predict  cloud  amount  statistics  for  any  ume  of  year  and  time  of  day 

2.2  COMPACT  STORAGE 

The  number  of  sets  of  compact  storage  coefficients  for  each  of  the  five  selected  parameters  arc  listed 
below 

1  Blended  mean  sky/cloud  cover  statistics  from  Burger,  DOH,  and  NIMBUS-7  data.  (12 
months  x  8  times  or  96  sets) 

2.  Mean  Correlations  over  a  sky  dome  as  computed  from  Burger  and  DOE  data.  (12  months 
x  8  times  or  9ft  sets) 
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3.  Populations  derived  Prom  blending  Burger.  DOE,  and  NIMBUS-7  data  sample  si/.es.  <  12 
months  x  8  times  or  9.3  sets.) 

4.  Annual  standard  deviations  of  sky  cover  statistics  derived  from  DOE  data.  (12  months  or 
12  sets.) 

5.  Satellite  mean  correlations  derived  from  NIMBUS-7  data.  (12  months  x  2  times  or  24 
sets.) 

The  sets  of  coefficients  were  spooled  onto  floppy  disks  for  use  in  the  development  of  an  automated  cloud 
climatology  to  be  housed  on  small  personal  computers.  Regional  spatial  and  temporal  correlations  of  sky- 
cover  conditions  over  selected  I)  ATS  A  V  data  stations  were  plotted  up  and  used  to  define  equations  for 
predicting  joint  cloud  amount  events  within  the  automated  cloud  climatology  program 

2.3  CLOUD  STATISTICS  PLUS  OR  MINUS  ERROR 

In  summary,  C  Cloud  S  (Climatology  of  Cloud  Statistics)  is  a  compact  global  cloud  climatology  residing 
on  an  '  compatible  computer.  The  basic  databases  are  derived  from  NIMBUS-7  satellite  data. 
D0E7NCAR  surface  data,  and  DoD  surface  data.  Cloud  models  based  on  special  datasets  are  used  to 
calculate  derived  slaustics  The  following  staustics  together  with  estimated  emor  are  available  for  any  point 
or.  the  globe,  any  lime  of  year,  any  time  of  day: 

SKY  COVER  STATISTICS 

1 .  Probability  of  Fractional  Sky  Cover 

2.  Duration  of  Sky  Cover  Category 

3.  Recurrence  of  Sky  Cover  Category 

4.  Conditional  Climatology  of  Sky  Cover. 

CLOUD-FREE  LINE-OF-S1GHT 

5.  Probability  of  CFLOS 

6  Duration  of  CFLOS 

7.  Recurrence  of  CFLOS 

8  Joint  Probability  of  CFLOS  at  N  Sites 
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SATELLITE-BASED  VIEWING 
9  Cloud  Coverage  Over  an  Area 

10.  Cloud  Coverage  Over  a  Lino 

11.  Probability  of  LOOK/SEE 

DATA  BASE  PARAMETERS 

12.  Mean  Sk'  ..over 

13.  Sky  Do  Mean  Correlation 

14  Large  .rea  Mean  Correlation 

15  Effective  Period  of  Record 

Output  ib  in  the  form  of  numbers,  graphs,  histograms,  or  maps.  Output  can  be  sent  lo  the  screen,  to  a 
printer,  or  to  a  file  for  use  by  other  computer  programs.  Limitations:  C  Cloud  S  is  limited  to  total  sky 
cover,  not  clouds  at  given  altitudes. 

2.4  CLIMATOLOGICAL  DATABASES  ANO  DATA  PROCESSING  ACTIVITIES 

The  following  section  and  sections  3  through  7  describe  the  climatological  databases  used  for  input  to  C 
Coud  S  Also,  data  processing  activities  such  as  data  extraction,  organizanon,  interpolation,  analysis, 
compression,  and  blending  of  these  data  ensembles  are  described  in  detail.  Row  diagrams  arc  included 
to  demonstrate  hew  data  sources  are  combined  with  developed  software  to  produce  end  results.  These 
diagrams  all  use  i.  common  form  of  symbols  defined  in  Table  2.  The  diagrams  show  daia  sources  used 
hy  data  transfer  software  developed  to  formal,  compress,  and  transfer  data  lo  permanent  files.  Backup 
tapes  and  labels  record  where  all  data  for  each  project  are  kept  for  permanem  reiention.  Most  of  the  data 
processing  activities  subsequently  described  were  performed  on  a  CDC  6800  CYBER  NOS  system  or  a 
CONVEX  UNIX  system  The  software  was  written  in  standard  FORTRAN  5.  Several  FORTRAN 
subroutines  and  functions  critical  to  (he  processing  effort  are  listed  in  Appendix  A. 

3.  BURGER  DATA 

The  Burger  daia  (Burger,  1985)  were  compiled  mainly  from  the  Revised  Uniform  Summaries  of  Surface 
Weather  Observations  (RUSSWO's),  National  Intelligence  Surveys  (NIS),  and  NAV ATLAS  records.  The 
resulting  data  ensemble  contains  station  names,  latitude-longitude  locators,  fractional  sky  cover  histograms, 
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Table  2.  Symbol  Definitions  Used  in  Data  Processing  Configuration  Diagrams 


Magnetic  Tape  Data  Storage 


Pf?OC  . 


> 


Procedure  Deed  for  Batch  Processing 


I  PROGPOn 

I _ 


Computer  Prograa  (FORTRAN) 


PROOUCT 


Resulting  Product  or  Other  Programs 


Account  Number 

Permanent  Disk  Data  Storage 


Floppy  Disk  Storage 


mean  sky  covers,  scale  distances  (Burger.  1985),  and  in  some  cases,  period  of  record  for  over  2,(XK) 
weather  reporting  stations  around  the  globe.  The  data  are  archived  for  the  four  mid-season  months  of 
January,  April,  July,  and  October  Three  hourly  averages  of  sky  cover  are  centered  at  the  four  local 
standard  times  of  01, 07,  13,  and  19  I.ST.  Data  over  the  ocean  areas  are  archived  for  the  same  four  mid¬ 
season  months,  but  sky  cover  values  are  monthly  means.  Willand  (1988)  revised  the  original  digitized 
version  of  the  Burger  sky  cover  archive  of  RUSSWO's,  NIS,  and  NAV ATLAS  data  into  a  more  compact 
compatible  format.  The  data  are  now  stored  in  three  separate  files  called  RUSARC,  FINARC.  and 
SEAARC,  respectively.  Subsequently  the  revised  dataset  was  used  in  the  development  of  a  prototype 
automated  global  cloud  climatology  (CLOUDZ),  Boehm  and  Willand  (1988).  A  detailed  description  of 
the  processing  of  the  data  required  for  the  development  of  the  latest  version  of  the  automated  climatology 
of  cloud  statistics  is  discussed  next. 

3.1  PROCESSING  OF  BURGER  DATA 

The  data  flow  configuration  for  processing  Burger  data  is  shown  in  Figure  2.  The  FINARC  data  portion 
of  the  Burger  data  were  mainly  uulized  in  the  following  processing  efforts.  Except  for  a  few  selected 
southern  hemisphere  stations,  the  RUSARC  and  SEA  ARC  portions  of  the  Burger  data  were  excluded  from 
the  processing  because  of  possible  overlap  with  the  DOF,  climatology  (discussed  in  Section  4.0) 

3.1.1  Program  SDTORO 

Initially,  FINARC  data  were  spooled  from  backup  tape  M 14555  on  to  permanent  disk  file.  The  procedure 
XEQSDT  was  then  used  to  call  on  program  SDTORO  which  read  the  data  and  convened  the  scale 
distance  parameters  to  mean  correlations  (see  subroutine  SKYRO,  Appendix  A).  Missing  sample  sizes  for 
stations  showing  climatological  sky  cover  values  were  set  to  450  observations,  or  5  years  x  30  days  x  3 
hours.  The  resulting  data  together  with  date-lime  information  and  locators  were  written  to  permanent  disk 
files  in  a  standard  format  designed  to  be  utilized  in  interfacing  with  other  datasets.  Thus,  a  date  time 
group,  latitude-longitude,  mean  sky  cover,  standard  deviation,  mean  correlation,  and  population  count  lor 
each  stauon  for  each  separate  month  and  time  period  were  written  to  permanent  disk  storage  using  the 
following  format; 

WRITE(NTAPE,  100)ITC,ISEA,ITM,ITIM,LAT,LON,M,S,RO,P 

100  FORMAT(I3,I2,ll,l2,Ffi  2,F7.2,3Ffi.3,I8)  (3.1.1) 
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Figure  2.  Flow  Diagram  for  Processing  Burger  FINARC  Sky  Cover  Data  into  a  Form  Suitable  lor  Merging  with  Other  Data  files. 


Variables  are  defined  as 


!TC;  Data  type  code 

iSF.V  Month/Season  code. 

ITM:  Time  Code. 

I  TIM:  Time.  00,03,06, ..  21. 

I. AT:  Latitude.  -90  <=  L.AT  <=  90  (REAL) 

LON:  Longitude.  0  to  360  degrees  east.  (REAIA 

M:  Mean  Coud  cover  probability  0.0  <=  M  <=  1.0  (REAL) 

S:  Standard  deviation.  0.0  <=  S  <=  1.0 

RO:  Mean  correlation  0.0  <=  RO  <=  1.0 

P  Population 

The  convened  data  were  written  to  account  number  3  disk  storage  under  names  like  BJAN01,  07,  13,  19 
and  BAPR01,  etc.,  meaning  Burger  data  for  January,  April,  July,  or  October  at  01,  07,  13,  or  19  LST 
respectively. 

3.1.2  Program  INFIN 

INFIN  uses  linear  interpolation  methods  to  interpolate  the  data  written  by  SDTORO  to  8  times  ol  day 
beginning  with  00  LST.  Thus,  resulting  data  was  again  written  to  permanent  files  only  this  lime  v  till 
names  like  BJANOO,  03,  Of.  ...21  LST  or  BAPR(X),  03.  06,  etc.  Tire  final  datasets  were  spooled  to  tape 
M 14225  for  permanent  retention.  The  INFIN  process  was  necessary  in  order  to  obtain  data  in  a  form 
suitable  for  merging  wiih  the  DOE  data  discussed  in  Section  4.0. 

4.  DOE/NCAR  DATA 

The  "Climatological  Data  for  Clouds  Over  the  Globe  from  Surface  Observations"  compiled  by  Hahn 
(1987)  was  another  important  sky  cover  database  used  in  the  development  of  C  Cloud  S.  This  cloud 
database  was  prepared  and  archived  for  use  in  several  U  S.  Government  energy  agencies,  including  the 
United  States  Department  of  Energy  (DOE)  and  the  National  Center  for  Atmospheric  Research  (NCAF '. 
The  database  contains  global  maps  (90  deg  N  to  90  deg  S)  of  long-term  monthly  and/or  seasonal  total  vVy 
cover,  cloud  type  amounts  and  frequencies  of  occurrence,  low  cloud  base  heights,  harmonic  analysis  t 
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annual  and  diurnal  cycles,  interannual  variations  and  trends,  and  cloud  typo  co-occurrences  Most  rf  the 
data  are  mapped  on  grids.  Kach  grid  box  represents  a  5-degree  latitude  by  5-degree  longitude  resolution 
equaiorward  of  50  degrees  latitude.  Poleward  of  50  degrees,  the  longitudinal  length  of  the  boxes  is 
increased  to  maintain  approximate  equal  area  in  each  box.  According  to  Hahn  (1987),  data  from  all 
available  stations  wilhin  a  grid  box  were  used  to  eompuie  the  average  for  that  box.  The  data  were  archived 
so  that  land-observed  sky  cover  data  were  stored  separately  from  ocean-observed  data.  The  period  of 
record  for  land  observations  is  the  II  years  from  January  1971  through  December  1981.  The  period  of 
record  for  ship  observations  is  Ihe  54  years  from  January  19V)  through  November  19K.V  Portions  o|  ihc 
l)()H  data  archive  arc  available  in  the  form  ol  aliases  (Warren  et  al  .  1986),  and  considerable  additional 
information  is  available  on  a  single  magnetic  tape 

4.1  PROCESSING  OF  DOE  DATA 

The  computerized  system  for  processing  the  DOE  data  is  depicted  in  Figure  3.  The  portion  of  l)()h  data 
selected  to  represent  sky  cover  staustics  within  the  C  Cloud  S  system  was  that  archived  at  8  times  of  day 
and  averaged  over  the  four  seasons  Winter,  Spring.  Summer,  and  Fall  These  statistics  provided  valuable 
information  for  defining  and  predicting  diurnal  cycles  of  global  sky  cover 

4.1.1  Program  FETCH 

Program  FETCH,  which  is  invoked  by  the  procedure  XEQFET,  extracted  global  maps  of  selected  cloud 
statistics  from  separate  files  of  the  DOE  cloud  climatology  tape.  The  files  had  been  assigned  map  group 
numbers  specified  by  Hahn  (1987).  Specifically,  selected  global  maps  of  mean  sky  cover  for  four  seasons 
at  eight  GMT's  daily  over  land  areas  were  extracted  from  file  2  under  map  group  numbers  14  through  45. 
Ocean  data  for  the  same  parameter  were  extracted  from  file  6  under  map  group  numbers  1080  through 
Mil.  The  extracted  data  were  stored  under  the  names  NCI  080,  NCI  096,  etc  The  parameters  stored  for 
each  box  encountered  were  mean  sky  covers  and  standard  deviations,  populations,  central  box  laiuude- 
longitude  reference  points,  and  date-time  information. 


1 1 


Data  Flow  Configuration  Used  in  Processing  the  DOF  Seasonal  Cloud  Databases. 


4.1.2  Program  MLO 


Procedure  XEQM1 .()  calls  on  program  MI.O  in  order  to  merge  DOK  land  observations  of  mean  sky  cover 
with  ocean  observed  mean  sky  conditions.  During  the  merging  process,  the  standard  deviation  of  mean 
sky  cover  tor  each  5 -degree  x  5-degree  grid  box  was  adjusted  by 


n 


(I) 


where 

s'=  DOE  standard  deviation  ol  mean  sky  cover 
at  a  grid  point 
n  =  sample  size 
s  =  adjusted  standard  deviation 

In  addition,  mean  correlation  was  computed  using  the  given  mean  sky  cover  amount  and  the  adjusted 
standard  deviation  squared  for  each  grid  point  as  input  to  an  accurate  tetrachoric  correlation  function  like 
that  listed  in  Appendix  A  and  described  in  Smyth  (1991).  Normally  the  four  arguments  A,  B,  C.  and  D 
sent  to  this  function  represent  normalized  counts  in  a  fourfold  contingency  table  of  dichotomous  events 
However,  in  order  to  compute  the  mean  correlation  for  each  box  encountered  these  arguments  were  set 
to  the  following  computed  values. 


A  =  V  +  M  x  M 

(2) 

B  =  M-A 

(3) 

C  =  B 

(4) 

D  =  1.0-A-2C 

(5) 

where 

M  =  Mean  sky  cover 
V  =  Variance 

then 

Mean  Correlation  =  TKTRA(A,B,C.P) 
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A  dale  lime  group,  latitude -longitude,  mean  sky  cover,  standard  deviation,  mean  correlation,  and 
population  count  for  each  grid  point  for  each  separate  season  and  tune  period  were  written  to  permanent 
disk  storage  with  the  standard  format  shown  in  3. 1. 1.  Names  such  as  WINTOO  (winter  at  (X)  (iMT)  were 
assigned  to  the  data  files  A  total  of  eight  files  at  eight  GMT's  lor  each  of  four  seasons  were  assembled 

4.1.3  Program  REORG 

Program  REORG  initiated  by  pnxedure  XEQREO  was  implemented  to  reorganize  the  datasets  created  by 
program  MLO  and  thus  prepare  them  for  subsequent  interpolation  The  8  files  for  each  separate  GMT  for 
each  season  were  extracted,  reordered  and  stored  in  the  format  4  I  below, 

WRITF.(NTAPE.400)YLAT.XLON,(XM(1T).IT=I.X).(XR(IT),IT=I,8). 

X  (IP(IT),1T=1,X) 

400  FORMAT<IX,F5.1,F6  1.2(8F5  3).XI5)  (4  1) 

where 

YLAT  =  Latitude 

XLON  =  Longitude 

XM  =  Mean  sky  cover 

XR  =  Mean  correlation 

IP  =  Populauon 

IT  =  Time  index  1  through  8. 

“Tie  reordered  datasets  were  named  TW1NT,  TSPRG,  TSUMM,  and  TFALL  meaning  winter,  spring, 
summer,  and  fall  respectively.  These  newly  formed  data  values  were  then  better  suited  for  use  in  the  time 
series  interpolation  process  described  next. 

4.1.4  Program  TS 

Procedure  XEQTS  calls  on  program  TS  (Time  Series  interpolation)  to  linearly  interpolate  missing  mean 
sky  cover,  mean  correlation,  and  population  values  within  the  reorganized  DOE  dataset  For  instance, 
many  ot  the  DOE  data  values  were  archived  every  four  hours  GMT  (e  g  ,  00,  Oft,  12,  and  18Z),  bui  often 
data  for  the  in  between  times  (e.g.,  03,  09,  15,  and  21Z)  were  missing.  The  interpolation  procedure  to  fill 
in  these  missing  data  values  provided  a  more  stable  da>  j,et  for  use  with  a  more  robust  Fourier 
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interpolation  scheme  (Section  4.1.5)  used  to  convert  the  entire  data  ensemble  in  GMT  to  mean  apparent 
sun  time,  MAST.  The  final  TS  interpolated  data  ensembles  were  again  stored  on  separaie  files  for  each 
of  the  four  seasons.  The  tiles  were  given  names  such  as  TWIN,  TSI’R.  TSUM,  and  THAI.. 


4.1.5  Program  T9090 


The  data  values  within  the  DOF  dataset  at  this  point  of  the  data  processing  activity  were  all  representative 
of  Greenwich  Mean  Time  (GMT).  In  the  development  of  an  automated  cloud  climatology  it  is  more 
desirable  to  have  the  values  representative  of  Mean  Apparent  Solar  Time  (MAST)  Therefore,  program 
T9090  was  developed  to  transform  the  DOE  data  from  GMT  to  MAST  The  data  ensembles  created  by 
program  TS  were  used  as  input.  The  T9090  program  then  utilized  a  Fourier  series  in  lime  to  transform 
the  data  from  GMT  to  MAST. 

The  procedure  begins  by  sending  each  8  hour  set  of  values  for  a  given  parameter,  (e  g.,  mean  sky  cover, 
mean  correlation,  or  population  for  a  given  station  within  a  given  season)  through  an  analysis  routine  that 
uses  Cholesky  regression  initially  and  then  a  Founei  series  to  create  X  coefficients.  These  coefficients  were 
then  used  to  predict  the  value  of  a  given  parameter  for  any  given  lime  (GMT)  The  GMT  used  for 
prediction  at  a  given  point  and  MAST  was  computed  using  6. 

GMT  =  -  24  Hrs/360  Deg  x  ELONG  +  MAST  (6) 

where 

ELONG  =  Longitude  (Degrees  East) 

Tests  were  then  made  to  ensure  that  GMT  remained  within  a  24-hour  cycle.  Tests  were  also  made  on  the 
magnitudes  of  predicted  population  values  in  order  to  keep  them  within  tolerable  limits  for  numerical 
stability.  The  maximum  predicted  population  value  set  for  the  winter  season  was  990  observations,  spring 
and  summer  was  1012,  and  fall  was  1001. 

This  process  was  repeated  for  every  point  in  a  DOE  grid  system  for  each  parameter  creaung  a  synthesized 
data  file  at  every  three  hours  MAST.  These  data  files  consisted  of  a  date  ume  group  (DTG),  latitude- 
longitude  locators,  mean  sky  cover  (M),  standard  deviation,  mean  correlation  (R),  and  population  tl*)  for 
winter  (W),  spring  (G).  summer  (S),  and  fall  (F)  in  the  standard  format  shown  in  3.1.1.  Names  such  as 
WMRP200  were  assigned  to  each  file.  Here,  2  represents  the  version  number  and  00  the  MAST 
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Note  that  at  this  point,  the  interpolated  Burger  FIN  ARC  data  discussed  in  Section  3  were  merged  with  the 
resulting  MAST  version  of  the  DOE  data. 

4.1,6  Program  WESPAN 

The  merged  DOE/BURGER  datasets  in  MAST  were  transferred  to  a  UNIX  computer  system  for 
subsequent  processing  through  an  analysis  program  called  WESPAN,  WEighted  SPectral  ANalysis.  The 
program  was  initiated  by  a  Bourne  shell  script  called  "RUN"  that  resides  on  "STDIN". 

4.1 .6.1  WESPAN  Weighted  Spectral  Analyeis 

Spectral  analysis  is  accomplished  by  using  a  weighted  spectral  analysis  in  longitude  with  fully  normalized 
associated  Legendre  functions  in  latitude.  Since  data  is  irregularly  spaceJ  (and  weighted),  the  weighted 
correlation  matrix  is  inverted  rather  than  separately  calculating  coefficients  which  assume  orthogonality 
between  the  spectral  analysis  functions.  Note  that  weights  are  r.ot  applied  when  doing  the  analysis  of 
population  values.  Values  of  mean  sky/cloud  cover  and  mean  correlation  are  converted  to  equivalent 
normal  deviates  before  performing  the  calculations  described  below. 

The  variance  of  the  mean  value  is  proportional  to  the  number  of  cases  regardless  of  the  time  correlation 
structure.  Thus  the  weights  are: 

w,  =  N,  (7) 

where  w,  is  the  weight  at  the  ith  location  and  N,  is  the  number  of  days  of  data  at  the  same  location.  N, 
in  this  case  is  not  allowed  to  be  greater  than  the  population  limits  set  in  Section  4.1.5.  These  limits  are 
well  within  bounds  since  climate  studies  show  that  the  variance  of  the  mean  does  not  grow  much  smaller 
after  about  15  years  of  data  for  a  given  month  (15  years  x  31  days  =  465). 

The  analysis  described  here,  except  for  the  inclusion  of  weights,  is  much  like  any  other  curve  fitting 
technique  where  lots  of  data  are  compressed  into  coefficients  which  are  subsequently  used  to  predict 
certain  parameters  within  some  degree  of  accuracy  at  a  given  state.  Thus  given  that  L  is  the  matrix  of 
locations  and  N  is  the  number  of  locations: 
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L  = 


latv 

lont 

toj. 

lon2 

lof*. 

: 

bnN 

(S) 


and  V  is  the  vector  of  values  (e  g.,  mean  sky  cover,  correlation  or  population)  at  each  point 


and  finally  W  is  the  diagonal  matrix  ot  weights,  i  e  wM  is  the  weight  lor  the  ith  location. 


W 


w,  ,  •••  0 


0  •••  w. 


NJf. 


(10) 


it  is  desired  to  derive  a  regression  fit  to  the  data  of  the  form: 

224 

v  "  E  ai S^’  k") 

<  •  o 


where  a,  are  regression  coefficients  and  S,  are  spectral  analysis  functions  of  latitude  and  longitude,  eg, 

F  <x»(2  lon)tin(lat) 


where  F  is  a  normalization  constant  different  for  each  spectral  function  To  solve  for  the  values  of  a, ,  we 
first  transform  the  location  matrix  L  into  a  matrix  S  that  gives  values  of  spectral  analysis  functions  at  each 
location. 
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where  m  is  the  number  of  spectral  analysis  functions,  or  225  in  this  case.  In  the  program  these  are  labeled 
0  to  224. 

The  mean  value  and  standard  deviation  of  each  spectral  function  is  calculated  using  the  value  of  the 
function  at  each  of  the  N  locations  and  weight  at  that  location.  II  the  locations  were  uniformly  spaced  over 
the  globe  and  the  weights  were  all  one,  then  the  mean  value  would  be  zero  1  Using  these  mean  values 
and  standard  deviations,  the  spectral  analysis  functions  are  normalized  by  subtracting  the  mean  and 
dividing  by  the  standard  deviation.  These  normalized  values  are  identified  by  use  of  a  caret  over  the 
variable  in  equations  14  and  15  below.  Thus  the  weighted  correlation  matrix  R 

r  =  5TwS  1 ,4) 

and  the  vector  &  of  normalized  weighted  regression  coefficients  is  found  using  (Tompkins  and  Wilson. 
1969): 

A  -  R  '$tWV  (l5) 

The  actual  inversion  is  accomplished  using  the  Cholesky  square  root  algorithm  (Westlake,  1968).  Then 
the  normalized  regression  coefficients,  A.  are  converted  to  ordinary  regression  coefficients  by  multiplying 


'This  is  one  major  difference  from  the  typical  use  of  spectral  functions  in  numerical  weather 
prediction 
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by  ihe  standard  deviation  of  v  and  dividing  by  the  standard  deviation  of  the  corresponding  spectral 
function.  Thus 


a 


i 


<IA) 


Hight  sets  of  coefficients  for  each  of  the  8  MAST's  for  each  season  for  the  3  parameters  mean  sk>  cover 
(S).  mean  correlation  (RO).  and  population  (P)  were  stored  on  permanent  files  with  names  like  NWMtX) 
that  stands  for  New  Winter  Mean  coefficients  for  00  MAST.  These  sets  of  coefficients  and  accompanying 
software  were  stored  on  tape  M 14223  for  backup  purposes  The  coefficients  were  written  out  using  the 
format  below: 

WKITH  (NTAPK.IOO)  I,  A(I) 

100  F0RMAT(16,EI5.7)  (4.1.6) 

where 

I  =  Index  (0  througn  224) 

A  =  Derived  coefficient 

4.1.7  Program  SHARMS 

Program  SHARMS,  not  shown  in  the  data  flow  configuration,  was  designed  to  predict  the  parameters  of 
mean  sky  cover,  mean  correlation,  and  population  for  any  given  location  on  the  globe  using  the 
coefficients  derived  from  program  WESPAN.  Predicted  mean  sky  covers  and  mean  correlations  were  in 
units  of  equivalent  normal  deviates  which  were  converted  to  values  between  0.0  and  1.0  by  routine 
PNORM  (Appendix  A).  The  program  predicted  a  selected  parameter  at  every  five  degrees  of  latitude  and 
longitude  through  use  of  equation  (11).  Each  predicted  value  was  stored  in  an  array  of  72  by  37  points. 
Bilinear  Interpolation  between  the  points  was  then  used  to  create  patterns  of  predicted  parameters  that  were 
mapped  onto  a  cylindrical  equal  area  projection  of  the  globe  from  90  deg  South  to  90  deg  North  The 
equal  area  projection  was  simply  generated  by  using  a  linear  scale  in  longitude.  Latitude  was  scaled  as 
sine  of  the  latitude.  Resulting  maps  were  used  for  quality  control  of  the  data  processing  effort  throughout 
the  entire  project. 


Figure  4  depicts  the  global  patterns  of  percent  mean  sky  cover  as  predicted  from  the  sky  cover  coefficients 
denved  from  DOE/Burger  data  for  winter  at  (X)  MAST  Figure  5  shows  the  corresponding  mean 
correlation  patterns  as  predicted  from  coefficients  of  mean  correlation  for  the  same  time  period. 

Figure  ft  shows  patterns  of  predicted  percentages  of  population  values  for  the  winter  case  at  (X)  MAST. 
The  approximate  magnitude  of  the  population  for  a  given  panern  can  be  computed  by 

N  =  (P+5*Vl(X)  x  PMAX  (17) 


where 


P  =  Percent  of  pattern  category  in  Figure  ft 
PMAX  =  990  (Upper  limit  of  sample  size  chosen  for  winter  cases) 

N  =  Approximate  sample  size  (Days) 

4.2  PROCESSING  INTERANNUAL  VARIANCE  OF  MEAN  SKY  COVER 

Interannual  standard  deviations  of  mean  sky  covers  were  computed  from  1 1  years  of  DOB  monthly  mean 
sky  covers.  Coefficients  of  the  standard  deviations  were  generated  in  order  to  subsequently  compute  and 
display  the  amount  of  expected  error  associated  with  predicted  sky  cover  amounts  and  other  related 
statistics  within  the  C  Cloud  S  program 

4.2.1  Program  FETC2 

As  shown  in  Figure  7,  a  return  to  the  DOF.  cloud  data  tape  M5550  was  necessary  in  order  to  extract 
monthly  mean  sky  covers  for  computing  interannual  variances,  Beginning  with  map  group  number  100, 
program  FETC2  invoked  by  XEQFH2.  extracted  from  file  number  2  the  monthly  mean  sky  cover  over 
land  areas  A  second  run  was  made  to  extract  the  same  parameter  for  the  grid  points  over  the  ocean  areas. 
These  were  found  in  file  6  stalling  with  map  group  number  1 340  The  extracted  data  were  labeled  on  disk 
tiles  with  names  such  as  JALI02  iJAnuary  Land  data  from  group  102). 
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Figure  4.  Pattern  Map  of  Predicted  Percent  Mean  Sky  Cover  for  the  Winter  Season  at  00  MAST.  Predictors  Were  Derived 
from  the  DOE/NCAR  and  Burger  Sky  Cover  Data  Ensembles. 
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Figure  5.  Pattern  Map  of  Predicted  Mean  Correlation  for  the  Winter  Season  at  »K)  MAST.  Predictors  Were  Derived 
from  the  DOE/NCAR  and  Burger  Sky  Cover  Data  Ensembles. 
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FiRiiro  6.  Pattern  Map  of  Predicted  Population  Percentages  for  the  Winter  Season  at  IH>  MAST.  Predictors  Were  Derivet 
from  the  DOF/NCAR  and  Iturger  Sky  Cover  Data  Ensembles. 
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Figure  7.  Data  Flow  Configuration  Used  in  Processing  the  Ihterannual  Standard  Deviations  of  Mean  Sky  Cover  Observations 
from  DOE  Data. 


4.2.2  Program  ANULAN,  AUNOCE 


Program  ANUl.AN  read  Ihe  monthly  mean  sky  cover  values  extracted  Iron)  FHTC2  and  summed  them 
tor  each  arid  box.  The  sum  of  the  squares  of  the  sky  cover  amounts  were  also  computed.  At  the 
completion  of  the  process  the  standard  deviation  of  mean  sky  cover  was  computed  for  each  grid  box  using 
the  basic  formula  below, 


(IS) 


where 


m  =  monthly  mean  sky  cover  amount 
n  =  number  of  observations 
s  =  positive  standard  deviation. 

A  date  time  group  DTG.  latitude-longitude  reference,  and  positive  standard  deviation  values  were  stored 
as  files  JALAND  or  FF.LAND.  etc.,  meaning  JAnuary  or  FEbruary  LAND  standard  deviation  values. 
Program  ANUOCE  handled  the  ocean  data  in  the  same  manner.  Names  like  JAOCEA  meaning  JAnuary 
OCEAN  values  were  assigned  to  these  data  files.  At  the  completion  of  this  process  the  land  data  were 
merged  with  the  ocean  data  and  the  land  data  files  purged. 

4.2.3  Program  WESPAN 

Finally,  program  WESPAN  read  the  intcrannual  standard  deviations  together  with  locators  and  compressed 
the  data  into  coefficients  which  were  stored  on  files  called  DIA201.  DIA202,  etc  ,  meaning  DOF, 
Inter  Annual  standard  deviations,  version  2  for  January,  01,  or  February,  02,  etc.  Twee  rets  of  coefficients 
were  generated,  one  set  for  each  month.  Tape  M 14223  was  utilized  to  retain  the  coc.'Fcients  and  the 
software. 

Figure  8  shows  predicted  patterns  of  interannual  standard  deviations  of  mean  sky  covers  derived  from  1 1 
DOE  January  monthly  sky  cover  means 
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Figure  8.  Pattern  Map  Showing  Predicted  Percent  Interannual  Standard  Deviations  of  Mean  Sky  Covers.  Predicted  Coefficients 
Were  Derived  from  DOE  January  Monthly  Mean  Sky  Cover  Data. 


5.  NIMBUS-7  CLOUD  DATA 


The  NASA  NIMBUS-7  cloud  database  was  generated  from  the  Temperature  Humidity  Infrared  Radiometer 
(THIR)  "1 1.5-mieron"  radiances  together  with  the  Total  Ozone  Mapping  Spectrometer  (TOMS)-derived 
"ultraviolet''  reflectivities,  climatological  temperature  lapse  rates  and  concurrent  surface  temperature  and 
snow-ice  information  from  the  Air  Force  three-dimensional  nephanalysis  archive.  (Hwang,  et  al.  lyssh) 

According  to  Hwang,  et  al.  (1988a),  the  primary  merits  of  the  NIMBUS-7  Cmatrix  Cloud  dataset  are  I) 
observed  radiances  used  for  this  cloud  dataset  were  obtained  from  the  same  instruments  used  over  six 
continuous  years  between  1979-1985,  so  it  represents  the  most  homogeneous  satellite  derived  cloud 
dataset  available;  2)  the  cloud  dataset  is  efficiently  retrieved  and  stored,  so  that  the  analysis  of  the  global 
cloud  distribution  over  climatological  time  scales  is  feasible;  and,  3)  daily  as  well  as  monthly  averages 
and  variances  of  cloud  data  are  presented  together  with  a  correlation  coefficient  surface  temperature 
archive.  Because  of  these  merits,  this  cloud  database  was  acquired  to  further  augment  cloud  statistics 
within  the  development  of  the  C  Cloud  S  program. 

The  NIMBUS-7  Cmatrix  cloud  data  were  received  on  seven  magnetic  tapes.  The  data  on  these  tapes 
consisted  of  more  than  100  parameters  that  were  archived  twice  a  day  at  noon  and  nrud.ughi  Among 
parameters  archived  are  daily  means  of  the  percentages  of  cloud  cover,  the  percentage  of  total  low .  middle, 
and  high  cloud  layer  amounts,  and  clear  air  radiances,  and  their  spatial  and  temporal  variances.  These 
parameters,  identified  by  number,  are  computed  for  2070  NIMBUS-7  earth  radiation  budget  (ERB)  target 
areas.  Each  target  represents  an  area  of  approximately  500  x  500  kms.  A  typical  spaual  distribuuon  ot  data 
within  the  2070  ERB  target  areas  can  be  seen  in  Figure  9.  Every  other  south-to -north  satellite  pass  is 
purposely  left  blank  to  enhance  the  actual  data  positioning  within  each  swath.  The  numbers  in  the  figure 
represent  Greenwich  Mean  Time  (GMT)  for  the  data  within  each  ERB  target  for  an  orbital  ascending  node 
at  noon  time. 

5.1  PROCESSING  NIMBUS-7  DATA 

The  major  portion  of  the  data  extracted  from  this  database  was  composed  of  daily  mean  cloud  covers, 
variances,  and  populations  within  each  ERB  target  for  each  of  twelve  months.  Figure  10  illustrates  the 
flow  of  events  used  in  the  processing  of  the  NIMBUS-7  Cmatrix  Cloud  data 
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A  Track  Map  of  NIMBUS-7  CMATRIX  Data  Dissemination  Over  2,070  ERB  Target  Areas.  Every  other  Orbital  Swath  is 
Purposely  l>eft  Blank  to  Show  the  Orientation  of  South  to  North  Noon  Time  Satellite  Coverage.  Numbers  in  the  Figure 
Represent  GMT’s  of  Data  Observations. 
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Figure  10.  Data  Flow  Configuration  Used  in  Processing  the  NIMBUS-7  Cloud  Cover  Climatology. 


5.1.1  Program  EXTRAK 

Procedure  XEQFF.T  calls  on  program  HXTKAK  to  extract  Irom  a  given  C'matrix  data  tape'  the  daily  maps 
of  parameters  selected  by  the  parameter  number  defined  in  the  Cmatrix  tape  user  documentation  by 
Wellemevcr  (1986).  A  list  of  the  tape  numbers  assigned  to  the  acquired  Cmatrix  data  tapes  and  then 
archived  period  of  record  are  listed  in  Table  3.  Table  4  lists  the  parameter  definitions  of  the  parameter 
number  codes  selected  lor  data  extraction.  The  extracted  daily  parameter  maps  for  all  months  and  irom 
each  Cmatnx  tape  are  stored  on  a  second  set  of  tapes  for  further  processing. 

Table  3.  CMatrix  Tape  Numbers  and  Archived  Period  of  Record 


Tape  Number 

Period  ot  Record 

M8197 

Apr  79-Oct  79 

M8196 

Nov  79-Od  80 

M8199 

Nov80-Od81 

M8202 

Nov  81 -Oct  82 

M6204 

Nov  82-00  83 

M8407 

Nov  83-00  84 

M8349 

Nov  84-Mar  85 

Table  4.  Parameter  Definition  Given  Selected  CMatrix  Parameter  Number  Code 


Number  Code 

Definition 

1 

Recommended  total  peroent  doudress  ascending 

2 

RMS  of  recommended  total  percent  cloudiness  ascending 

23 

Mean  subtarget  area  population  ascending 

25 

GMT  (mrutas)  ascending 

55 

Total  percent  cloudiness  descending 

56 

RMS  of  total  peroent  cloudiness  descendng 

75 

Mean  suntarget  area  population  descending 

77 

GMT  (minutes)  descending 

5.1.2  Program  SATSUM 


Program  SATSUM  is  utilized  to  sum  the  daily  map  data  stored  on  the  new  tapes  for  a  selected  month  and 
parameter  code  The  program  uses  a  navigation  table  called  TABDA,  published  by  Wellemeycr  ( 198(0. 
to  navigate  the  2070  NIMBUS-7  HRB  target  areas.  Daily  sums  of  4  parameters  over  individual  months 
for  a  period  of  record  of  six  years  were  stored  in  binary  form  on  files  named  J  ANSA  I  and  JANSA2, 
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JANSA6  meaning  January  Sums  for  years  1.2.  .  ft.  The  four  parameters  summed  aie  I)  mean  cloud 
cover,  2)  mean  cloud  cover  squared,  3)  GMT’s  and  4)  number  ol  observations. 

5.1.3  Program  NORSAT 

Program  NORSAT  reads  in  the  binary  data  summation  files  from  SATSUM  and  computes  the  mean  and 
standard  deviation  of  cloud  cover  in  each  HRB  target  area  for  both  ascending  and  descending  nodes  using 
equation  18  where 

m  =  sum  of  satellite  mean  cloud  cover 
n  =  satellite  population  count 
s  =  positive  standard  deviation 

Given  the  above  information,  mean  correlation  is  also  computed  using  the  methods  in  4.1.2.  A  date  time 
group,  latitude-longitude  locator,  mean  cloud  cover,  standard  deviation,  mean  correlation,  and  population 
for  each  ERB  target  area  are  then  written  to  permanent  files  using  the  standard  format  3.1.1.  Names  of 
the  files  written  were  SJANA,  SFEBA.  ...  SDECA  for  monthly  ascending  node  data  and  SJAND  for 
descending  node  data. 

5.1.4  Program  WESPAN 

The  files  created  by  program  NORSAT  above  are  transported  to  the  UNIX  system  where  program 
WESPAN  is  utilized  for  compressing  the  satellite  data  into  coefficients  Tho  coefficients  generated  are 
for  mean  cloud  cover  and  mean  correlation  at  00  and  12  MAST.  Coefficients  are  not  generated  for  the 
satellite  population  values  since  the  sample  sizes  within  all  ERB  (ai  gels  are  reasonably  constant  at  about 
186  or  31  days  x  6  years. 

Files  containing  the  satellite  coefficients  are  named  JANMA2  and  JANRA2  meaning  January  moan  cloud 
cover  (M)  and  January  mean  correlation  (R)  for  ascending  (A)  node,  version  2,  and  JANMD2  or  JANRD2 
for  the  descending  node.  The  January  file  is  followed  by  files  lor  FHH.  MAR.  etc 
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Near  (he  completion  of  each  WESPAN  computer  run,  a  multiple  correlation  coolTicieiu  is  computed  and 
primed  to  examine  how  well  (he  generated  coefficients  lit  the  data.  The  multiple  correlations  are  tahulaied 
for  each  run  They  are  presented  in  Table  5  for  NIMBUS-'?,  DOE,  and  DOE  interannual  standard 
deviations. 


5.1.5  Program  SATTIM 


The  NIMBUS-7  satellite  cloud  database  offers  excellent  spatial  coverage  of  cloud  amount  statistics  over 
the  globe  It  gives  good  estimates  of  cloud  coverage  over  areas  where  surface  observations  are  sparse  or 
completely  void.  However,  the  data  arc  collected  only  twice  a  day,  noon  and  nudmght.  Therefore,  we 
developed  program  SATTIM  to  interpolate  the  spatial  coefficients  derived  Irom  the  satellite  data  lor 
predicting  cloud  amounts  at  00  and  12  MAST  to  eight  times  of  day  (00,  03,  06,  ...  21  MAST)  To 
accomplish  the  mierpolation  tor  each  coefficient  A.  we  use  a  first  order  Fourier,  equation  19,  with  a 
temporal  phase  adjustment  of  2  hours,  and  an  amplitude  adjustment,  of  the  amount  computed  in  equation 
20  below 


4(0 


(a,  +h(at-b)coa 


2n(t-2) 
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(19) 


where 


006(30°) 


(20) 


t  =  M AST(00, 03,06,.  .  or  21) 

a,  =  00  MAST  cloud  cover  coefficients 

b,  =  1 2  MAST  cloud  cover  coefficients 
A,  =  coefficient  for  mean  cloud  cover 

i  =  Index  0  through  224. 
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Table  S.  Multiple  Correlation  Coefficients' 
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'l.mled  11c  multiple  correlation  coefficients  computed  hy  program  WI.SPAN  for  monthly  NIMIUIS  7  mean  IM)  cloud  covers  and  mean  correlations  {R>  denved  from  data  ovci 
ascending  nodes  (A)  ami  descending  nodes  (l)(  Multiple  correlation  coefficients  are  also  lislcd  under  mid  season  months  that  were  derived  for  live  eight  (‘.M  l  scas.vi.il  |H)I  mean 
sky  coven  (Ml.  mean  correlations  (R).  and  populations  (P)  Finally,  multiple  correlation  coefficients  derived  fiom  IMil.  monthly  iitlerannu.il  standard  deviations  arc  lislcd  in  the 
row  lahclcd  IA 


Hath  interpolated  set  of  225  coefficients  tor  predicting  mean  cloud  coverage  at  3  hourly  time  intervals  is 
stored  on  disk  files  with  names  like  SJANOO  or  SFHIJI2,  meaning  Satellite  JAnuary  00  or  FHbmary  12 
MAST.  The  resulting  new  ensemble  of  coefficients  was  then  in  a  form  suitable  for  the  blending  of  satellite 
mean  cloud  cover  predictors  with  ground  based  mean  sky  cover  predictors  front  Burger  and  1X)K  data, 
(see  Section  7) 

5.1.6  Program  SHARMS 

Program  SHARMS  used  die  coefficients  generated  by  WESPAN  to  produce  global  pattern  maps  of 
predicted  satellite  observed  mean  cloud  covers  and  mean  correlations  for  data  quality  control  An  example 
of  die  predicted  global  mean  cloud  cover  for  January  at  00  MAST  is  displayed  in  Figure  1 1  Mean 
correlation  for  January  at  (X)  MAST  is  shown  in  Figure  12  Tape  MI4224  was  used  to  hack  up  all 
software  and  files  generated  during  the  satellite  data  processing  effort. 

6.  DATSAVDATA 

This  dataset  contains  up  to  1 1  years  of  hourly  surface  observations  taken  at  individual  stations  around  the 
world  The  data  have  been  formatted  and  archived  on  magnetic  tape  by  the  U  S.  Air  Force  Environmental 
Technical  Applications  Center  (USAFETAC).  This  archive  of  surface  observations  is  called  DATSAV 
The  archived  hourly  data  consisls  or  weather  conditions  such  as  temperature,  winds,  dew  points,  total  sky 
covers,  cloud  types,  etc.  Documentation  of  the  DATSAV  database  can  be  found  in  the  DATSAV 
Reference  Manual,  1977  A  more  recent  updated  document  is  the  DATSAV2  Reference  Manual,  1986. 

6.1  PROCESSING  DATSAV  DATA 

DATSAV  data  tapes  for  66  stations  positioned  in  clusters  at  various  locations  around  the  world  were 
successfully  processed  by  Willand  (1988)  in  order  to  archive  hourly  mean  sky  cover  and  scale  distance 
parameters.  Since  then  26  additional  stations  wiUmn  new  clusters  were  added  to  augment  tins  data 
collection.  These  new  samples  were  in  the  updated  DATSAV2  format.  Figure  13  depicts  ihe  general 
positions  of  all  Ihe  acquired  DATSAV  clusters.  Table  6  lists  names  and  locations  of  the  stations  within 
each  cluster  The  data  flow  for  processing  the  DATSAV  data  is  shown  in  Figure  14, 
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NIMBUS-7  CMATRIX  Cloud  Coxer  Climatology. 


Map  of  Cluster  Positions  for  DATSAV  and  DA  I 


Table  6.  Listing  of  DATSAV  Stations  Within  Clusters 


Sir 

Lat 

Lon. 

Elev. 

Name 

MIDWEST: 

72638 

44  37 

-84.68 

0351 

Houghton  Lake,  Ml 

72652 

43.07 

•95.53 

0423 

Pickslowne,  SO 

72424 

37.90 

-8597 

0233 

Ft.  Knox.  KY 

72SS1 

40.85 

•96.75 

0362 

Lincoln,  NE 

72640 

42  95 

•87.90 

0220 

Milwaukee 

72641 

4313 

•89.33 

0262 

Madison 

72637 

42.97 

•89  73 

0238 

Flint 

72552 

40.97 

■98  32 

0563 

Grand  Isle,  NE 

72556 

41  98 

•97.43 

0479 

Norfolk,  NE 

72423 

38  18 

•85  73 

0151 

Loueville.  KY 

72635 

42  88 

■8552 

0242 

Grand  Rapids.  Ml 

72636 

43.17 

-86.23 

0191 

Muskegon.  Ml 

72529 

4312 

-77.67 

0171 

Rochester,  NY 

72528 

42  93 

-78  73 

0217 

Buffalo.  NY 

72545 

41  88 

-91  70 

0263 

Cedar  Rapids,  IA 

72546 

4153 

•93.67 

0292 

Des  Moines,  IA 

72734 

46.47 

•8437 

0220 

Sail  St  Marie 

SOUTHWEST: 

725690 

42.55 

-106128 

Casper,  WY 

746120 

3541 

•11T41 

0681 

China  Lake,  CA 

723865 

36  25 

•115  03 

0570 

Nelts  AFB 

724660 

38  82 

•104  72 

1881 

Colorado  Springs,  CO 

722800 

32.40 

-114  36 

0063 

Yuma,  AZ 

726940 

44  92 

•123  02 

0061 

SalenVMcNary 

MIDDLE  EAST  1: 

400800 

33  31 

36  81 

Damascus,  Syria 

401840 

31  46 

35.14 

Jerusalem,  Israel 

623660 

30.00 

31  17 

Cairo,  Egypt 

MIDDLE  EAST  II: 

404160 

2618 

50  08 

Dharan,  S.  Arabia 

404380 

24  31 

46  47 

Ridyadh,  S.  Arabia 
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Table  6.  Listing  of  DATSAV  Stations  Within  Clusters  (Continued) 


Sti. 

Lat 

Lon 

El«v. 

Name 

MIDDLE  EAST  II: 

405820 

2904 

47  59 

Kuwait.  Kuwait 

406890 

30  30 

4747 

Basrah,  Iraq 

ASIA  1: 

471220 

37  11 

127  04 

0041 

Osan  A  B .  S  Korea 

470580 

39.03 

125.48 

Pyong  Kang,  N.  Korea 

ASIA  II: 

476420 

35  45 

139.21 

Yokota.  Japan 

478270 

31  24 

130  39 

Kagoshima.  Japan 

ASIA  III: 

464560 

13  50 

100  29 

0004 

Bangkok.  Thailand 

488200 

2104 

105  50 

Hanoi,  Vieinam 

488550 

1608 

108.22 

Da  Nang.  Vietnam 

489000 

1046 

106  34 

Ho  Chi  Mmh.  Vieinam 

983270 

1S.1I 

120  32 

Clark  AFB,  Phillipines 

466960 

Tapei,  Taiwan 

CENTRAL  AMERICA: 

786410 

14.37 

•90  32 

Guatemala  Cila.  Guatemala 

786630 

13.45 

-89.11 

San  Salvador,  El  Salvador 

787200 

1408 

•87.15 

Tegucigalpa.  Honduras 

787410 

12.10 

-86.16 

Managua.  Nicaragua 

787600 

10.00 

■85.00 

Funta  Arenas.  Costa  Rica 

783060 

892 

•79  60 

Howard  AFB,  Panama 

SOUTH  AMERICA: 

90370 

087 

•77  68 

2975 

Ipiales 

80110 

622 

-7560 

1498 

Medellin 

80089 

700 

•74  72 

0610 

Amalfi  Aniioquis 

80259 

355 

•7638 

0969 

Cafc'Parameseca 

80308 

2.45 

-7660 

1730 

Popa  Van/leon-Valei 
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Table  6.  Listing  of  DATSAV  Stations  Within  Clusters  (Continued) 


su. 

Lit 

Lon. 

E  lev. 

Name 

AMAZON  BASIN: 

82678 

-617 

-42.58 

Ftonano/Cangapara 

82562 

-5.21 

•4907 

Maraba 

HAWAII: 

S1 166 

21  15 

-15710 

0138 

Molokai 

91162 

21.35 

-157.93 

0005 

Honolulu 

SWEOEN-NORWAY: 

01006 

69  07 

15.18 

0014 

Anda 

01025 

6968 

1892 

0010 

Troniso 

01059 

7007 

24  98 

0008 

Banak 

01041 

6975 

21.03 

0004 

Nororeisa 

10570 

69  37 

24  43 

0285 

Cuoulatmauki 

10490 

69.98 

23.3S 

0003 

Luflhaun 

01098 

7037 

31.10 

0014 

Vafdo 

20960 

67.20 

2342 

0168 

FajaJa 

22101 

68  90 

28  45 

0118 

Virtanieaii 

28360 

67.37 

2665 

0178 

Sodankyta 

22106 

68.60 

31.60 

0034 

Padjn 

22119 

68  32 

3333 

0130 

Pulozem 

22204 

67  55 

3043 

0221 

Kofdof 

22205 

67  60 

31  17 

0160 

Ena 

22216 

67  55 

33  39 

0136 

Apatity 

22232 

67  70 

3712 

0244 

Kolmya/r 

22249 

6713 

3967 

0149 

Kanouka 

GERMANY: 

10147 

53  63 

998 

0016 

Hamburg 

67500 

4665 

862 

2287 

Guetsdi 

67620 

4617 

888 

0198 

lacamo 

67700 

46.00 

897 

0276 

Logiano 

102  35 

5300 

983 

0077 

Sotlau 

10803 

48  02 

785 

0300 

Freiburg 

10908 

47.87 

800 

1493 

Feld  berg 
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Table  6.  Listing  of  DATSAV  Stations  Within  Clusters  (Continued) 


Sta. 

Lai 

Lon. 

Elev  ’ 

Name 

GERMANY: 

16066 

45.62 

8.73 

0211 

Miland 

16072 

45  87 

907 

1319 

Monte  BiSino 

16060 

4543 

928 

0107 

Milano  linaic 

06700 

46  25 

6  13 

0436 

Geneu/Cointrin 

06670 

47.48 

8.53 

0432 

Zuritcb 

06680 

47  25 

9.35 

2500 

Gaenits 

06720 

46  22 

733 

0482 

Sion 

06760 

46  17 

8.78 

0379 

Locamo 

06610 

46  82 

695 

0489 

Payern 

06990 

47  13 

9.53 

0463 

Vaduz 

ALASKA: 

70174 

66  92 

-151.52 

0205 

Benles 

70063 

70  15 

■14320 

Prudhoe 

70265 

64  40 

•1477 

0128 

Eielson 

'Elev.  =  Meters 
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Figure  14.  Data  Flow  Configuration  for  Formatting  DATSAV  Data  for  Subsequent  Processing  Using  the  HMDP  Program 


6.1.1  Program  GETCOV 


The  How  of  data  processing  begins  with  the  newly  acquired  DATSAV2  data.  I’rocedurc  XHQGET 
positions  (he  DATSAV2  data  (ape  10  the  proper  1'ilc  so  that  program  (  iKTC'OV  can  extract  the  hourly  sky 
cover  observations  lrom  a  selected  station.  These  hourly  sky  cover  observations  1'rom  each  selected  station 
are  packed  and  stored  on  disk  as  binary  random  access  data  tiles  Detailed  packing  and  storage 
documentation  of  the  sky  cover  observations  can  be  found  in  Wiliam!  ( 19X8)  Alter  processing  the  newly 
acquired  DATSAV2  data  through  program  GETCOV,  backup  tape  M5693  containing  DATSAV  hourly 
sky  cover  observations  (processing  described  by  Willand  (1988))  are  spooled  hack  onto  disk  storage 
Hourly  sky  cover  data  for  all  the  selected  stations  in  each  cluster  are  then  ready  for  further  processing 

6.1.2  Program  FORDAT 


Program  FORDAT  was  written  to  format  DATSAV  sky  cover  data  into  an  acceptable  form  for  processing 
using  the  Biomedical  Data  Processing  Statistical  Software  Package  (BMDP)  (Dixon,  et  al.,  1988).  The 
BMDP  has  an  accurate  tctrachoric  correlation  package  which  is  useful  for  defining  spatial  and  temporal 
correlations  of  sky  cover  conditions  over  stations  within  the  DATSAV  clusters. 

Procedure  XEQFDA,  used  for  executing  program  FORDAT,  requires  a  FORTRAN  NAMELIST  input  file 
to  define  the  period  of  record  selected  for  formatting.  The  necessary  NAMELIST  arguments  are: 


SIDEF 

PRINT  =  TRUE., 
IYI  =  73. 

IY2  =  83, 

IMI  =  7. 

I  M2  =  7, 

NUMT  =  12. 


FYim  resulting  data  values 
First  year  of  data  extraction 
Second  year  of  data  extracuon 
First  month  of  data  extraction 
Last  month  of  data  extraction 
Pointer  to  output  file 


SEND 


The  format  used  by  program  FORDAT  to  write  (he  new  data  files  lor  BMDP  processing  is  as  follows 


43 


WRlTR(NTAPH.100)[STA,XLAT,XI.()N(i.lY,IM,I!).(U)ATA(l,U)),i=l.lTl.lT2) 

UX)  FORMAT(16.F6.2,F7.2, 312,2412) 
where 

ISTA  =  Station  number 

XI. AT  =  Station  latitude 

XI.ONG  =  Station  longitude 

IY  =  Year 

IM  =  Month 

ID  =  Day 

ITI  =  Start  time 

IT2  =  End  time 

The  tormatted  ASCII  data  Tiles  are  first  stored  on  the  CYBHR  system  and  then  transmitted  to  the  VAX 
system  which  processes  spaual  and  temporal  correlations  of  sky  cover  using  the  BMDP  package.  The 
resulting  spatial  and  temporal  correlations  are  then  represented  graphically  to  define  the  decay  ol 
conditionality  over  space  and  time  within  each  cluster.  Figure  15  shows  the  temporal  decay  of  sky  cover 
derived  from  16  stations  over  the  mid-western  United  States  for  January. 

7.  DATABASE  BLENDING 

At  this  point  of  the  data  processing  acuvities,  the  FINARC  portion  of  Burger  data  is  merged  with  the  DOH 
database  and  the  determination  of  the  coefficients  is  completed  The  coefficients  predict  mean  sky  covers, 
mean  correlations,  and  populations  for  up-looking  sky  cover  conditions  at  three  hourly  intervals  in  MAST 
tor  the  four  seasons  Also  completed  are  the  coefficients  for  predicting  mean  cloud  covers  and  mean 
correlations  from  down-looking  satellite  observations  for  the  same  three  hourly  intervals  but  for  each  of 
12  months  The  next  step  in  the  processing  converts  DOE  seasonal  sets  of  coefficients  to  monthly  sets  and 
in  the  process  blends  together  coefficients  for  predicting  ground  observed  mean  monthly  sky  covers  with 
those  for  predicting  satellite  observed  mean  monthly  cloud  covers.  The  flow  of  the  blending  process  is 
shown  in  Figure  16  and  details  of  the  processing  are  discussed  below. 
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Correlalions  of  Sky  Cover  for  the  Midwestern  ll.S.  Cluster. 
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NIMHl  S-7 


7.1  PROGRAM  BLEND 


Iniiiully,  program  BLEND  reads  in  all  sols  of  coefficients  needed  lo  achieve  ihe  blending  process  'Hus 
involves  reading  and  storing  into  the  program  the  coefficients  lor  DOF  mean  sky  covers  -  Si.  mean 
correlations  (  RO)  and  populations  (P)  lor  all  lour  seasons  at  all  eight  3  hourly  MAST's  Also  read  and 
stored  are  all  the  coefficients  lor  satellite  mean  cloud  covers  (  C)  adjusted  for  all  8  times  of  day  lor  each 
of  the  12  months,  (Section  5. 1.5,  equation  19)  The  following  computations  are  then  performed  on  ul!  the 
incoming  coefficients. 

First,  coefficients  derived  from  DOE  and  Burger  climatologies  for  predicting  mean  sky  cover,  mean 
correlation,  and  population  by  seasons  are  converted  to  coefficients  capable  of  predicting  the  three 
parameters  by  months.  This  is  accomplished  through  use  of  equations  21  and  22 

2n(Af-l)  r]) 

12 

/ 

and  / 

(WrGt+St+Fy(WrG+SrF)ax(2x)  (GrF^smM*(WrS,) cos(x)  ,,,, 

4  *  2 


where 

M  =  Month  (1-12) 

W  =  Coefficient  for  Winter 
G  =  Coefficient  for  Spring 
S  =  Coelficient  for  Summer 
F  =  Coefficient  for  Fall 
s  =  Coefficient  for  month  M. 
i  =  Index  0  through  224. 

Each  coefficient  for  predicting  monthly  mean  sky  cover  is  then  blended  with  (he  corresponding  coelficient 
lor  predicting  monthly  mean  cloud  cover  from  satellite  climatologies  using  equation  2?. 
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(23) 


where 

s,  =  Computed  monthly  DOE  coefficient  lor  mean  sky  cover  (equation  22). 

A,  =  Corresponding  satellite  coefficient  for  mean  cloud  cover  (equation  19). 
w,  =  .6470588 
w2  =  .3529412 
a,  =  Blended  coefficient 
i  =  Index  0  through  224. 

We  derived  the  weights  (w,  and  w,)  in  equation  2?  by  setting  a  maximum  sample  size  for  a  DOE  data 
sample  to  be  341  days  (31  days  x  1 1  years)  and  a  maximum  satellite  data  sample  size  to  he  186  days  (31 
days  x  6  years).  Total  sample  size  is  then  527  days  (186  ■*-  341).  Thus,  341/527  equals  .6470588,  which 
is  the  weight  w,  that  is  applied  to  each  DOE  monthly  mean  sky  cover  predictor  S,.  The  weight  w2  applied 
to  each  coefficient  to  predict  satellite  monthly  mean  cloud  cover  A,  is  (1.0-w,)  or  .3529412. 

Mean  correlations  derived  from  satellite  observed  cloud  amounts  do  not  correlate  well  with  those  derived 
from  ground  observed  sky  covers.  Therefore,  coefficients  for  predicting  monthly  mean  sky  dome  mean 
correlations  arc  simply  the  results  of  equation  22. 

Coefficients  computed  from  equation  22  for  predicting  population  values  were  blended  with  the  satellite 
population  predictors  by  simply  adding  the  ratio  of  18ft  days/341  days  or  .545454  to  the  first  coefficient 

Figure  17  depicts  patterns  of  percent  sky  cover  predicted  from  the  blended  coefficients  for  January  at  00 
MAST. 


Note  that  coefficients  for  predicting  mean  correlauon  are  not  blended  or  weighted  but  remains  as 
coefficients  computed  directly  irom  equations  21  and  22. 

Final  coefficients  are  written  to  permanent  storage  with  names  such  as  M30KX),  meaning  coefficients  lor 
predicting  monthly  mean  sky  covers  M  version  number  3  for  January  01  at  (X)  MAST.  R  and  P  aie  used 
as  prefixes  to  the  names  of  monthly  mean  correlation  and  population  coefficients,  respectively.  Backup 
tape  Ml 4224  is  used  (o  store  program  BLEND  and  all  die  blended  coefficients. 
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Figure  17.  Pattern  Map  of  Predicted  Percent  Mean  Sky  Cover  fur  January  at  00  MAST.  Predictors  Were  Derived  from  Blending  tlie 
Coefficients  for  Predicting  Sky  Cover  (DOF/Burger  Data)  With  Those  for  Predicting  Cloud  Cover  (NIMBUS-7  data). 


7.2  PROGRAM  NFT 


NR'  is  a  network  file  transfer  utility  that  is  utilized  to  transler  final  coeiiicients  from  the  mainframe  to 
(loppy  disk  storage.  The  data  stored  on  these  (loppy's  are  m  ASCII  code  Their  lormat  is  described  in 
format  4. 1.6. 


Specifically,  final  version  3  of  blended  coefficients  for  predicting  mean  sky  covers  M.  and  the  coefficients 
for  mean  correlations  R,  and  populations  P,  are  organized  on  floppy  disks  by  month  and  time  in  the 
following  manner. 


Disk  1  M 30 1 00,03... 21  Disk  2  M30300.03...21  Disk  3  M 30500.03... 2 1 


R30I00.03...21 
P30I00.03...21 
M  30200, 03. ..21 
R30200.03...21 
P30200.03...21 


R  30300,03. .21 
P30300.03...2 1 
M30400.03...2I 
R  30400, 03... 21 
P304(K).03 .21 


R  30500.03.. .21 
P30500.03...21 
M  30600, 03. 2 1 
R  30600,03... 2 1 
1’306(X),()3.  21 


Disk  4  M 30700.03..  . 21  Disk  5  M 30900, 03.. .2 1  Disk  6  M3 1 100,03  . .2 1 


R30700.03...21 
P30700.03...21 
M30800.03...21 
R  30800,03... 21 
P30800.03...2I 


R30900.03...21 
P30900.03...21 
M3 1000,03. .21 
R3 1000,03.  .21 
P3 1000, 03... 21 


R31 100,03  .21 
P3I100.03...2I 
M3 1 200.03..  . 21 
K3 1 200,03  .21 
P3 1 200,03  .21 


Coefficients  for  predicting  DOF.  (D)  interannual  (I A)  standard  deviations  (version  2)  of  sky  cover  are 
stored  on  a  single  floppy  disk  number  7  for  each  month  01  through  12  as 


Disk  7  D1A201  DIA202  DIA203  D1A204  DIA205  DIA206 
DIA207  DIA208  DIA209  DJA2I0  DIA2II  DIA2I2 


Finally,  the  coefficients  for  predicting  satellite  mean  correlation  are  stored  on  a  single  floppy  disk  number 
K  in  the  Pillowing  order.  APRRA2  represents  month  of  April,  R  is  mean  correlation.  A  is  ascending  node, 
and  I)  is  descending  node  for  version  2. 
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Disk  8  APRRA2 
JUNRA2 
APRRD2 
JUNRD2 


AUC1RA2  DKCUA2  KKHRA2  JANRA2 
MARRA2  MAYRA2  N0VRA2  0CTRA2 
AUGRD2  DHCRD2  KKBRD2  JANRD2 
MARRD2  MAYRD2  NOVRD2  OCTRD2 


JUI.RA2 

SHPRA2 

JUI.RD2 

SKPRD2 


7.3  PROGRAM  DODAT 


Program  DODAT  was  written  ir,  Quick  BASIC  on  a  Zenith  248  PC  to  convert  ASCII  files  of  C  Cloud 
S  coefficients  listed  above  to  Quick  Basic  BLOAD  binary  files.  The  program  uses  the  actual  ASCI!  names 
of  the  files  from  floppy  disks  to  navigate  the  coefficients  into  the  proper  bins  Therefore,  if  new  tile  names 
for  coefficients  on  floppies  are  changed  for  any  reason,  the  file  names  in  DODAT  must  also  he  changed 
and  the  program  recompiled.  DODAT  compresses  each  incoming  coefficient  in  1-115.7  format  io  integer 
form  by  multiplying  it  by  10000.0  and  then  storing  the  result  as  a  16  bii  binary  integer.  (Interannual 
standard  deviation  coefficients  are  multiplied  by  10000  be: ore  conversion  to  integer  form.)  Upon  program 
completion,  a  list  of  maximum  absolute  values  of  coefficients  encountered  for  each  parameter  is  made 
available.  This  list  was  checked  to  insure  that  none  of  the  integerized  values  exceeded  2**1 3ih  The  new 
compact  coefficients  are  then  suitably  arranged  for  storage  in  the  C  Cloud  S  program  as  BLOAD  files  and 
could  also  be  stored  on  a  single  floppy  disk  (360  version)  along  with  other  files. 
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FUNCTION  TETRA( A,B,C, D) 

t******ti****ii***ii*ttt***********ii*it*****tt*tt**** 

ORIGINAL  TETRA  FUNCTION  WAS  WRITTEN  IN  BASIC 
BY  A.  BOEHM  OF  ST  SYSTEMS.  THIS  FORTRAN  VERSION 
WAS  ADAPTED  FOR  THE  CDC  CYBER  60  BIT  COMPUTER 
BY  J.  H.  WILLAND,  ST  SYSTEMS,  ON  01/03/R9. 

PARAMETERS  A,  B,  C,  D  MAY  BE  EITHER  NORMALIZED  ELEMENTS 

OR  ACTUAL  COUNTS. 

RETURN  THE  TETRACHORIC  CORRELATION. 

REF.  BOEHM  1976:  TRANSNORMALI ZED  REGRESSION  PROBABILITY, 
AWSTR-75-259,  P24. 


DATA  PI/3.1415926536/ 


Note  that  -999.0  is  returned  for  robar  if  no  solution. 

R— 999.0 
AD-SQRT(A*D) 

BC-S0RT(B*C) 

I F ( AD . LE .0.0. OR .BC.LE.0.0)GO  TO  99 

-  FIRST  GUESS  - 

-  PANOFSKY  AND  BRIER,  SOME  APPLICATIONS  OF 

STATISTICS  TO  METEOROLOGY,  1965,  PP.  103-104. 

RSIN-SIN(PI/2. *((AD-BC)/<AD+BC) )) 

R-RSIN 
XN-A+B+C+D 
P1-(A+C)/XN 
HO— ENORM(Pl) 

P2-(A+B)/XN 
OK— ENORM(P2) 

AN-A/XN 
Rl-0. 

D1-P1*P2-AN 

c  -  ITERATIVE  SOLUTION  - 

DO  10  K-1,25 

DO-FLNORM( HO , OK , R ) -AN 

IF(  ABS(  DO )  .  LT  .-1 . 0E-7  )GO  TO  99 

RS-R 

IF(ABS(D0-D1) ,LT.1.0E-7)GOTO  99 
R~R-D0* ( Rl-R )/( DI-DO ) 

DI-DO 

Rl-RS 

c  Next  line  was  corrected  12/2/92.  (l.,R)  was  (R,R).  J.  w. 

IF(ABS(R) .GE. 1.0)R-(SIGN{ 1. ,R)+RS)/2. 

10  CONTINUE 
99  TETRA-R 
RETURN 
END 


FUNCTION  TFUNC2 ( A, B, C) 

C 

C  AVOIDS  DIVISION  BY  ZERO. 

C  . . . 

IF(ABS(C) . LT. 1 . OE-0 )THEN 

Y“SIGN(B,B)*(1. 0-PN0RM( ABS ( A ) ) )/2. 
TFUNC2-Y 
ELSE 

TFUNC  2-TFUNC ( A , B/C ) 

ENDIF 

RETURN 

END 
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FUNCTION  TFUNC(IIO,A) 
DATA  PI/3  .  J  4  15926536/ 


0  ********* . ************ 

O  B I  NORMAL  SECTOR  ]  NTK.GRA1 .. 

C  REF.  YAMAUT! , (  1  972)  :  STAT .  TARDUS  AND  FORMUI.AS  WITH  COMPUTER 

C  A  PP  1. 1  CAT  IONS  ,  .1 AP  STAT.  ASSC.  , PROGRAM  IS 

C  ****** . ********** 

IF<ARS(IIQ)  .  I.F,.  1 .0E-50J0O  TO  70 


A1 ~ABS( A) 
lll-IIQ 

IF(Al.LT.l.O) 00  TO  9 
111-A.lMIQ 
Al-1 ./Al 
9  AA-A 1  * A l 
1111-111*111/2. 
mil-ALOG(HII) 

SUM2-EXP  ( -Till ) 

SUM1-1 .  -SUMP. 

FACT-0 . 0 
S— AA 

DO  10  J-l , 80 
X-J 

FACT- FACT- At  ,OG(  X  )  t  Dill 
SUM2-SUM2  +  FXP (  1110  FACT) 

C-S*  (  1  .  -SUM2)/(  2.  ‘XU  .  ) 

SUM  l -SUM  HO 

IF(A8S<C) . LT. I . OE-10)GO  TO  11 
S— S‘AA 

10  CONTINUE 
PRINT  100, 110.  A 

100  FORMAT  (  "  TFIJUC  1 10  A"  ,  2F10 . 1  .  1  X ,  "DID  NOT  CONVERGE.") 

11  T- ( ATAN  (  A 1  ) -SUM 1*A1)/(2.*PI) 

IF( ABS ( A ) . LE. 1 . 0)GO  TO  19 

AA-PNORM(-IIQ) 

I1II-PN0RM(  -III  ) 

T-{  AA-MIH)  *  ,5-AAMIII-T 

19  IF( A . LT .0.0 )T'  -T 
TFUNC-T 

GO  TO  99 

20  TFUNC-ATAN (  A  )  /  (  2  .  *  P I  ) 

99  RETURN 

END 

FUNCTION  FLNORM (  U ,  V ,  R  ) 

C  ******* . *************** . ************** 

C  COMPUTE  THE  INTEGRAL  OF  THE  BIVARIATE  NORMAL 

C  DISTRIBUTION  UP  TO  LIMITS  U  AND  V, 

C  REF  OWEN, 1980:  A  TABLE  OF  NORMAL  INTEGRALS, 

C  COMMON  STATISTICS. -SIMUI^-COMPUTA. , 

C  B9(4),  389-119,  EQ  3.1. 

DATA  PI/3 . 14  I S9 26536/ 

c  - 

IF(ABS(U)  .  LT.  I.OE-8  .AND.  ABS  (  V  )  .  I,T .  1 .  OE- 8  )GO  TO  999 

II--U 

XK--V 

Y-  (  PNORM(  II )  +  PNORM(  XK  )  )/2. 

1F( U . LT . 0 . . AND. XK ,GE. 0 . )Y-Y-.5 
IF(XK.LT.O. . AND.H.GE.O. )Y-Y-. 5 
Y-Y-TFIJNC2 ( II ,  XK-R‘II,  l(*SORT(  1 . -R*R) )- 
X  TFUNC2(XK,II-R*XK,XK*S0RT(  1  .  -  R  *  R  )  ) 

FLNORM- Y 
RETURN 

999  FLNORM- . 25*  AS  I N( R ) /( 2 . • PI ) 

RETURN 

END 
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FUNCTION  PNORM(E) 


O  . . . . 

C  END  TO  PROBABILITY 

C  ADAPTED  FOR  TliE  CDC  60  niT  COMPUTER  DY  J.  W7LI.AND  (STX) 

C  CUMULATIVE  NORMAL. 

C  INTEGRAL  OF  THE  STANDARD  NORMAL  DISTR  I  niJTION 

C  FROM  INFINITY  TO  E. 

C  REF.  AORAMOWIT7,  AND  STEGUN  (196-1);  HANDBOOK  OF 
C  MATH.  FUNCTIONS,  NHS,  EQ.  26.2. 19. 

C  . . . 

A-ABS(E) 

A- ( ( ( ( ( (5. 621326E-G*A<  5. I 05517E-5 ) *A^ 3  . 9686I6E-5) ‘A*  3  .  -I  227  3  9E- 3  ) 
X*A»  2. 2077E-2)»A<-5. 207  516E-2)  *At  1  .  M  .  -1  27  370E -  2  ) 

A-A*  *(—16) 

IF(E.GT.0.0)A-1.-A 
PNORM-A 
RETURN 
END 

FUNCTION  ENORM(P) 

******»************•»**»• 

PROBABILITY  TO  EQUIVALENT  NORMAL  DEVIATE  (END) 

ADAPTED  TO  THE  CDC  60  BIT  COMPUTER  BY  J.  WILLAND  (STX) 

ALGORITHM  AS111,  APP  STAT  26  1977  PI 18. 

INVERSE  OF  STANDARD  NORMAL  DISTRIBUTION 
INTEGRATION  FOUND  IN  PNORM. 

t*****ttt**tt**ttt*tt*t» 

Q-P-  .  5 

IF( ABS(Q) . GT. 0. 4  2 )GOTO  10 
R-Q*Q 

R-Q*  ( ( (-25.1'1I06*R+-H.3912)*R~10.615)*R*2. 506620  )/ 

X  (  (  <  (3.  130829*  R-21. 0622-1  )‘Rf  23. 00337  )*R~0. -173511  )*R  +  1.  ) 

ENORM-R 
RETURN 

10  IF(Q.  LE .  0 . 0  )TIIEN 
R-P 
ELSE 
R~l.-P 
END  IF 

IF(R.LT. 1.0E-500)R-1.0E-500 
R-SORT(-ALOG(R) ) 

R-(  (  (2.321213*RM .  8501-1 1 )  *R- 2  .  297965)  *R-2. 707189)/ 

X  (<  1.637068*Rt-3. 5-13009)  *RO  .  ) 

IF( Q . LT . 0 . 0 ) THEN 
ENORM— R 
ELSE 

ENORM-R 
END  IF 
RETURN 
END 
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SUBROUTINE  SKYRO< R, RBAR) 
*********************************** 


MEAN  CORRELATION  GIVEN  SCALE  DIST.  BY  A.  BOEHM  AND  J.  WILLAND 
ST  SYSTEMS  109  MASS.AV,  LEXINGTON  MA. 

VERSION  3.0  2/19/91 

ASSUMES  2424  KM  SKY  DOME  AND  WAVELENGTH- 256R  THAT  WAS  USED  IN  BAA 
EDGE  IS  DISTANCE( IN  WAVELENGTHS)  ALONG  EDGE  OF  SQUARE  WITH  2424  KM  AREA 
R-SCALE  DISTANCE  (KM) 

RBAR- RETURN  AVERAGE  CORRELATION  (ROBAR) 

********************************** 

DATA  N/21/ 

C  EDGE“SQRT( 24  24.)/(256.*R) 

EDGE-. 19232/R 

C  SUM= ( 21  *  21 )/4  =  110.25 

SUM-110.25 
DX-EDGE/(N-1) 

DO  20  1-1 , N~1 
DO  10  J-0,N-1 

SIG“SQRT( FLOAT ( I )  *  * 2+FLOAT( J) *  *  2 ) *DX 
D-SIG  -  INT(SIG) 

C-l .  ~3*D  -t-2  .  *D*D 
IF( SIG .GT. 1 . )  C  -  D*C/SIG 
SUM-SUM  +  (N-I)*(N-J)*C 
10  CONTINUE 
20  CONTINUE 

RBAR- 4 . *SUM/N*  *  4 

RETURN 

END 
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